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Abstract 

Parallel  computations  often  yield  computation  structures  which  ire  trees;  the  shape  of  such 
a  tree  evolves  over  time  as  the  computation  progresses.  However,  parallel  computers  are 
usually  designed  as  networks  of  processors  with  fixed  connections;  it  is  therefore  important 
to  embed  the  dynamic  structure  of  a  computation  efficiently  in  a  fixed  network.  We 
consider  the  problem  of  dynamically  embedding  an  evolving  binary  tree  with  at  most  N 
nodes  in  an  N-node  hypercube.  We  present  a  simple  randomized  algorithm  which  uses 
only  local  control  and  guarantees  constant  dilation,  while  maintaining  constant  load  with 
high  probability;  this  is  the  first  load-balancing  algorithm  which  achieves  constant  dilation. 
We  also  prove  that  random  solutions  to  this  problem  are  highly  desirable,  by  demonstrating 
that  any  deterministic  embedding  algorithm  which  maintains  constant  load  must  have 
0(7  log  N)  dilation.  *— 
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Abstract 


Parallel  computations  often  yield  computation  structures  which  are  trees;  the  shape 
of  such  a  tree  evolves  over  time  as  the  computation  progresses.  However,  parallel 
computers  are  usually  designed  as  networks  of  processors  with  fixed  connections;  it 
is  therefore  important  to  embed  the  dynamic  structure  of  a  computation  efficiently 
in  a  fixed  network.  We  consider  the  problem  of  dynamically  embedding  an  evolving 
binary  tree  with  at  most  N  nodes  in  an  JV-node  hypercube.  We  present  a 'simple 
randomized  algorithm  which  uses  only  local  control  and  guarantees  constant  dilation, 
while  maintaining  constant  load  with  high  probability;  this  is  the  first  load-balancing 
algorithm  which  achieves  constant  dilation.  We  also  prove  that  random  solutions  to 
this  problem  are  highly  desirable,  by  demonstrating  that  any  deterministic  embedding 
algorithm  which  maintains  constant  load  must  have  fl(v/log  iV)  dilation. 


X  Introduction 

Solving  large  problems  efficiently  in  parallel  sometimes  leads  to  a  computation  which  has  a 
tree  structure.  Examples  of  this  are  divide-and-conquer  and  branch-and- bound  computations 
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[KZ].  In  general,  the  resulting  computation  tree  is  dynamic  in  that  the  tree  grows  and  shrinks 
at  its  leaves  in  an  unpredictable  way. 

However,  large  parallel  computers  are  built  with  a  fixed  set  of  connections:  the  neighbors 
of  a  processor  do  not  change  over  time.  Therefore  we  must  map  the  nodes  of  a  dynamic 
tree  to  the  processors  of  a  fixed-connection  network.  In  order  to  insure  fast  communication 
between  tree  nodes,  adjacent  nodes  in  the  tree  should  be  mapped  to  processors  which  are 
near  each  other:  in  order  to  insure  that  full  advantage  is  taken  of  the  available  processing 
power,  the  nodes  of  the  tree  should  be  spread  out  evenly  in  the  network  so  that  no  single 
processor  has  to  simulate  disproportionately  many  tree  nodes.  This  leads  us  to  the  question 
of  how  to  efficiently  embed  dynamic  trees  in  a  fixed-connection  network. 

In  this  paper  we  consider  the  problem  of  embedding  dynamically  growing  rooted  binary 
trees  in  a  hvpercube  network.  There  are  several  measures  of  the  efficiency  of  an  embedding. 
One  is  dilation ,  defined  to  be  the  maximum  distance  in  the  hypercube  between  two  nodes 
which  are  adjacent  in  the  embedded  tree.  Low  dilation  m^ans  that  each  tree  node  is  Hose 
to  its  parent,  facilitating  communication  between  them.  Another  important  measure  is  the 
load  of  an  embedding,  which  is  the  maximum  number  of  tree  nodes  mapped  to  a  single 
node  of  the  hvpercube.  A  good  embedding  will  have  a  load  which  is  low,  implying  an  even 
distribution  of  tree  nodes  and  therefore  efficient  use  of  the  available  processors.  Note  that 
this  problem  has  been  solved  any  fixed  binary  tree,  as  it  is  known  that  any  ^V-node  binary 
tree  can  be  embedded  one-to-one  in  an  Af-node  hvpercube  with  constant  dilation  [BCLRj; 
our  results  show  that  these  properties  cam  be  maintained  as  the  tree  evolves. 

In  this  paper  we  present  a  simple  randomized  algorithm  to  embed  a  dynamically  evolving 
binary  tree  with  at  most  N  nodes  in  an  A''- node  hypercube.  The  algorithm  is  distributed  and 
requires  only  local  control;  constant  dilation  is  guaranteed,  and  with  high  probability,  the 
resulting  embedding  will  also  maintain  constant  load.  This  work  is  closely  related  to  recent 
work  of  Bhatt  and  Cai  [BC];  in  their  paper  they  present  a  randomized  algorithm  for  this 
problem  which  guarantees  0(loglog  N)  dilation,  and  achieves  constant  load  for  O(N)- node 
trees  with  high  probability.  Though  at  first  glance  the  algorithm  in  this  paper  may  seem 
nearly  identical  to  that  of  Bhatt  and  Cai,  subtle  differences  in  the  use  of  randomness  lead  to 
a  provably  stronger  result.  In  addition,  we  prove  that  no  deterministic  embedding  algorithm 
which  maintains  constant  load  can  achieve  dilation  better  than  fl( \/log  N),  even  for  trees 
which  only  grow.  This  emphasiz  e,  t*ie  desirability  of  randomized  solutions  to  this  problem. 

The  rest  of  the  paper  is  organic  is  follows.  Section  2  contains  preliminaries  and  defini¬ 
tions  along  with  a  description  of  the  embedding  algorithm.  Section  3  contains  the  analysis 
of  our  algorithm.  Section  4  presents  the  negative  result  for  deterministic  algorithms. 


2  Preliminaries 

Let  T  be  a  rooted  binary  tree  which  is  growing  and  shrinking  over  time.  By  this  we  mean 
that  in  each  time  step  some  nodes  of  T  decide  to  grow  children  and  some  leaves  cease  to 
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exist.  Initially,  T  consists  of  only  the  root.  We  assume  that  T  never  has  more  than  N  nodes 
and  that  it  grows  and  shrinks  over  a  period  of  time  whose  length  is  polynomial  in  N. 

Let  Hn  denote  the  n-dimensional  hypercube  with  N  =  2"  nodes,  represented  by  the  set  of 
all  strings  in  {0, 1}".  The  n2n_1  edges  of  Hn  connect  nodes  with  strings  differing  in  precisely 
one  bit.  The  dimension  of  an  edge  is  the  bit  position  (0  to  n  —  1)  in  which  the  incident  nodes 
differ.  If  x  €  Hn,  then  x*  refers  to  the  node  adjacent  to  x  and  differing  from  it  in  the  itA 
dimension  bit. 

An  embedding  of  T  into  Hn  is  a  map  4>  :  T  *-*  Hn  that  maps  nodes  of  T  to  nodes  of 
Hn.  For  two  hypercube  nodes  x  and  y,  let  d(x,y)  be  the  Hamming  distance  from  x  to  y; 
i.e.  the  number  of  bit  positions  where  x  and  y  differ.  The  dilation  of  an  embedding  is 
max{d(<^(u),<£(u/))},  where  the  maximum  ranges  over  all  v  and  w  which  are  neighbors  in 
the  tree.  The  load  of  an  embedding  equals  max{<f>~l(x)\x  €  Hn) ,  the  maximum  number  of 
nodes  in  T  mapped  to  a  single  node  of  Hn. 

For  any  node  x  €  Hn ,  the  star  centered  at  x  is  the  set  of  nodes  consisting  of  x  itself  and  all 
neighbors  of  x.  Each  star  contains  n  +  1  nodes.  We  present  an  algorithm  which  dynamically 
embeds  the  tree  T  in  Hn  such  that  all  stars  have  0(n)  tree  nodes  mapped  to  them.  We  will 
then  show  how  to  slightly  alter  this  embedding  (dynamically)  so  that  the  resulting  load  is 
constant.  Throughout,  the  dilation  will  be  constant. 

Fix  an  arbitrary  constant  b.  (For  ease  of  exposition,  we  assume  that  b  divides  n.)  Call 
the  distance  from  a  tree  node  r  to  the  root  the  depth  of  v.  The  embedding  algorithm  maps 
tree  neighbors  to  hypercube  nodes  which  are  at  distance  at  most  b. 


for  each  node  v  which  grows  a  child  w 
y  *-  d>(v) 

let  v  be  at  depth  i 

for  j  *—  ib  (mod  n)  to  (t  +  1)6  —  l(mod  n) 
y  *—  yJ  with  probability  5 
4>{w)  <-  y 


Figure  1:  The  Embedding  Algorithm 


3  Analysis  of  the  Algorithm 

We  first  discuss  the  case  in  which  T  is  only  growing.  We  will  show  later  that  we  can  handle 
a  tree  which  shrinks  as  well. 

Theorem.  If  a  growing  tree  T  with  at  most  N  nodes  is  dynamically  embedded  in  the 
hypercube  Hn  according  to  the  rule  given  above  with  b  >  8,  then  with  high  probability  no 
more  than  0(n )  tree  nodes  are  mapped  into  any  star  of  Hn. 
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Figure  2:  An  Extended  Tree 

Fix  a  node  x0  €  Hn.  Call  a  node  v  6  T  a  star  node  if  <t>(v)  lies  in  the  star  centered  at  x0. 
VVe  will  prove  that  with  high  probability  there  are  only  0{n )  star  nodes.  If  we  let  x0  range 
through  the  .V  nodes  in  Hn ,  this  proves  the  theorem  as  well. 

Two  tree  nodes  v  and  w  are  close  if  d(u.  v),  d(u,  w)  <  where  u  is  the  lowest  common 
ancestor  of  v  and  w.  A  set  5  of  tree  nodes  is  a  close  set  if  for  any  two  nodes  v.w  €  S  there 
is  an  ordered  set  <  u,  >  of  elements  of  5  where  v.w  €  {uj  and  u,  is  close  to  ul+l  for  all  i. 
The  highest  node  of  a  close  set  is  that  node  in  the  set  at  the  lowest  depth  in  the  tree.  If  there 
are  more  than  one  at  the  lowest  depth,  the  highest  node  is  a  fixed  arbitrary  one  of  these. 

We  form  the  extended  tree  T  of  a  close  set  5  as  follows.  First  we  include  all  the  edges 
which  lie  on  the  (unique)  shortest  path  between  any  two  close  nodes  in  5.  By  the  definition 
of  a  close  set,  the  edges  between  all  pairs  of  close  nodes  form  a  tree.  We  refer  to  the  (unique) 
highest  node  of  this  tree  as  the  root  of  the  extended  tree.  Say  that  the  path  from  the  highest 
node  of  5  to  the  root  of  the  extended  tree  has  length  l.  Then  we  include  the  first  ~  —  l 
edges  on  the  (unique)  shortest  path  from  the  root  of  the  extended  tree  to  the  root  of  T.  If 
we  reach  the  root  of  T  before  traversing  £  -  /  edges,  we  stop.  These  new  edges,  the  tree 
already  defined,  and  all  incident  vertices  form  the  extended  tree  of  5.  The  extended  tree  of 
5  is  a  subtree  of  T  and  its  leaves  are  all  elements  of  S.  The  path  in  the  extended  tree  which 
contains  ancestors  of  the  highest  node  of  S  (but  not  the  highest  node  itself)  is  the  spine  of 
the  extended  tree.  The  highest  node  in  the  spine  is  the  tip  of  the  spine. 

Let  <  Si  >  be  the  indexed  family  of  maximal  close  sets  of  star  nodes.  By  calling  these 
sets  maximal  we  mean  that  the  union  of  any  two  of  them  is  not  a  close  set.  Let  <  T,  >  be 
the  associated  family  of  extended  trees. 

Lemma  3.1.  No  two  trees  T,  a ad  T}  share  a  node. 

Proof.  Any  node  u  in  an  extended  tree  has  a  close  star  node  descendent.  Thus  if  u  were 
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in  two  trees,  it  would  have  a  close  star  node  descendent  in  each  tree  and  these  nodes  would 
therefore  be  close  to  each  other.  In  that  case.  Si  U  Sj  would  be  a  close  set,  violating  our 
assumption.  | 

Lemma  3.2.  At  most  one  extended  tree  does  not  include  the  last  j  edges  along  the  shortest 
path  from  the  root  of  T  to  the  highest  node  in  the  associated  close  set. 

Proof.  Any  extended  tree  which  did  not  include  those  edges  would  include  the  root  of  T. 
By  lemma  3.1,  there  can  be  only  one  such  tree.  | 

Say  that  extended  tree  Ti  has  highest  node  t\,  contains  m,  total  star  nodes  and  that 
the  path  from  v,  to  the  root  r,  of  T,  has  length  Because  the  root  is  the  lowest  common 
ancestor  of  all  the  star  nodes  in  the  close  set,  either  both  of  its  children  have  star  node 
descendents  or  r,  is  a  star  node  itself.  If  r,  is  not  a  star  node  then  there  is  another  star  node 
v\  whose  path  to  does  not  intersect  the  path  from  t>«  to  r*.  Furthermore,  this  path  has 
at  least  U  —  1  nonstar  internal  nodes  since  otherwise  v'{  would  be  closer  to  than  is  t\.  If 
r,  is  a  star  node  then  is  also  the  highest  node  and  f,  =  0.  In  this  case  —  1  =  —  1  easily 
bounds  the  number  of  nonstar  nodes  off  the  spine.  Thus  in  the  extended  tree  there  must 
be  at  least  the  nodes  on  the  spine,  the  m,  star  nodes,  and  the  /,  —  1  nonstar  nodes  off  the 
spine  accounted  for  above.  All  other  nodes  we  denote  as  surplus;  let  s ,  be  their  count.  In 
the  extended  tree  T,  we  have  a  total  of  mi  +  /,  +  Sj  —  1  nodes  not  on  the  spine. 

Lemma  3.3.  Over  all  ways  to  choose  which  nodes  in  T  are  star  nodes ,  no  more  than 
2 3m,+4i,+3j(— t  (jjfferent  extended  trees  may  be  formed  with  highest  node  v,,  mi  total  star 
nodes,  d(vi,ri)  =  l,  and  mi  4-  /,  +  s,  —  1  nodes  off  the  spine. 

Proof.  The  nodes  on  the  spine  are  fixed  by  v,.  All  other  nodes  in  the  tree  descend  from 
the  nodes  on  the  spine.  We  can  grow  any  extended  tree  from  this  spine.  Maintain  a  queue 
of  extended  tree  nodes.  Initialize  the  queue  with  the  nodes  on  the  spine.  At  any  step,  if  the 
node  at  the  head  of  the  queue  has  children  in  T,  decide  which  of  them  are  to  be  included  in 
the  extended  tree.  Add  to  the  queue  those  children  included  in  the  extended  tree.  Once  we 
have  processed  m,  +  /,  +  s,  —  2  nodes  we  may  stop.  The  one  remaining  node  in  the  queue 
has  no  children  in  the  extended  tree. 

Each  of  the  first  /<  —  1  nodes  on  the  spine  has  at  most  one  child  not  on  the  spine.  For 
each  node  we  can  choose  whether  or  not  her  child  belongs  to  the  extended  tree.  There  are 
at  most  2l,~l  ways  to  do  this.  The  ljh  node  along  the  spine  is  the  root  of  the  extended  tree. 
Her  other  child  must  be  in  the  tree.  All  other  nodes  on  the  spine  also  have  no  choice:  their 
other  children  surely  do  not  belong  to  the  tree.  Any  node  not  on  the  spine  has  at  most  two 
children  and  thus  at  most  four  choices  for  including  her  children  in  the  tree.  Thus  the  total 
number  of  ways  to  grow  the  remainder  of  the  tree  is  no  more  than  4m,+,,+*’~2. 

Finally,  we  choose  which  of  the  nodes  in  the  extended  tree  are  star  nodes.  These  cannot 
lie  on  the  spine.  The  node  v,-  certainly  is  a  star  node.  We  must  choose  m,  —  1  of  the  other 
nodes  to  be  star  nodes.  There  are  no  more  than  2m,+l,+*,~J  ways  to  do  this. 

Many  of  these  choices  do  not  result  in  valid  extended  trees.  First,  t>,  might  not  be  the 
highest  node  of  the  extended  tree  formed.  Second,  we  might  not  choose  all  the  leaves  as  star 


nodes.  However,  we  have  overcounted  at  worst.  Multiplying  the  three  factors  bounds  the 
number  of  ways  to  specify  the  shape  of  the  extended  tree  and  to  choose  the  star  nodes  in  it. 

I 

Lemma  3.4.  Let  T,  be  an  extended  tree  with  highest  node  v,,  m,  total  star  nodes,  d(v,,  r, )  = 
l,  and  m,  +  /,  +  s,  —  1  nodes  off  the  spine.  The  probability  that  T,  results  during  the  embedding 
of  T  is  no  more  than  :ty(2~k(b  +  l))m,+/,+J,“2  if  it  does  not  contain  the  root  and  at  most 
(2~b(b  +  if  it  does. 

Proof.  No  matter  where  the  tip  of  the  spine  is  embedded,  the  coordinates  changed  along  the 
path  to  Vi  must  force  d>(t>{)  to  lie  in  the  star.  If  the  path  is  of  length  then  each  coordinate 
has  exactly  one  chance  to  change.  Thus  v*  is  equally  likely  to  be  mapped  anywhere  in 
Hn.  The  probability  i\  is  mapped  into  the  star  is  precisely  .  Otherwise,  we  bound  the 
probability  that  t\  is  a  star  node  by  1.  Deciding  how  these  coordinates  change  determines 
the  embedding  of  all  nodes  in  the  spine. 

Consider  a  path  of  length  p  <  £  from  a  node  u  to  the  star  node  v  along  which  no  other 
nodes  are  star  nodes.  Say  that  the  embedding  <p(u)  has  already  been  determined,  but  that 
the  coordinate  changes  in  all  edges  along  the  path  have  yet  to  be  decided.  There  are  pb 
coordinates  considered  along  the  path.  If  4>(u)  agrees  with  x<>  in  all  the  other  coordinates, 
we  might  choose  exactly  one  of  the  pb  path  coordinates  to  differ  from  that  of  xo  or  we  might 
choose  them  all  to  be  the  same.  If  we  choose  more  than  one  to  differ,  then  r  cannot  map 
into  the  star,  a  contradiction.  If  <f>{u)  and  Xo  do  not  agree  in  the  nonpath  coordinates,  then 
there  is  at  most  one  setting  of  the  pb  coordinates  which  would  lead  <t>(v)  to  lie  in  the  star.  In 
either  event,  the  probability  that  the  coordinates  were  chosen  in  a  way  consistent  with  the 
tree  structure  (i.e.  which  nodes  are  star  nodes)  is  at  most  Ejjr  <(2-*(6  +  i))r 

Recall  that  we  have  determined  the  embeddings  of  the  nodes  on  the  spine.  Arbitrarily 
dissect  the  remaining  edges  into  paths  of  length  at  most  £  which  end  at  star  nodes  but 
have  no  star  nodes  internal  to  them.  We  can  do  this  because  every  node  which  is  not  a 
leaf  of  the  extended  tree  has  a  star  node  at  most  distance  j  below  it.  (A  leaf  which  is 
not  a  star  node  is  not  on  a  close  path  between  star  nodes  and  thus  would  not  have  been 
included  in  the  extended  tree.)  We  can  then  analyze  the  coordinate  changes  which  occur  in 
the  subtree  below  any  node  in  the  spine.  If  we  start  with  the  node  on  the  spine,  then  we  can 
consider  the  paths  in  turn  so  that  the  embedding  of  the  first  node  in  the  path  has  already 
been  determined  but  nothing  else  along  the  path  has  been  determined.  Coordinate  changes 
along  any  path  are  independent  of  what  has  been  determined  before.  One  bound  on  the 
probability  that  the  coordinate  changes  in  the  paths  actually  result  in  the  mapping  of  all 
star  nodes  into  the  star  is  the  product  of  the  bounds  on  the  probabilities  for  the  individual 
paths.  The  sum  of  the  lengths  of  all  the  paths  equals  the  number  of  edges  not  in  the  spine. 
Thus  we  may  use  the  upper  bound  found  in  the  previous  paragraph  to  bound  the  probability 
that  all  coordinate  changes  produce  the  desired  structure  of  the  tree.  This  probability  is  no 
more  than  (2“4(6  +  l  ))"»•+'.+'.-*.  | 

We  may  now  bound  the  probability  that  certain  structures  involving  star  nodes  arise 
during  the  embedding. 
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Lemma  3.5.  Fix  t.  the  number  of  extended  trees  formed  by  the  dose  sets  of  star  nodes  and 
index  the  trees  arbitrarily  as  <  T,  > .  Choose  a  node  v,  to  be  the  highest  node  of  T,  and  m, 
to  be  the  number  of  star  nodes  in  Tt.  Let  m  =  £,  m»-  Further,  choose  l,  and  s,  for  T,.  Then 
if  we  use  the  algorithm  shown  above  to  embed  T  in  Hn,  the  probability  that  the  resulting 
extended  trees  have  the  parameters  chosen  is  at  most 

(ILtiy-1  (22fc-7(6  +  l)-2)‘(23-4(fc  +  l))m  H(2A-b(b  +  l))''(23-6(6  +  I))*1 


Proof.  Fix  a  particular  family  of  extended  trees  satisfying  the  chosen  parameters.  As  T 
is  embedded,  we  know  by  lemma  3.1  that  the  choices  made  on  the  edges  of  any  tree  are 
independent  of  the  choices  for  other  trees.  Thus  to  bound  the  probability  that  the  family 
of  extended  trees  arises  from  the  embedding,  we  need  only  multiply  the  probability  bounds 
for  the  individual  trees.  This  product  is  at  most  (njr)*-1  ni(2~6(&+  l))m,+i,+,*~2  by  lemmas 
3.2  and  3.4. 

The  number  of  different  families  of  trees  <  Ti  >  satisfying  the  chosen  parameters  is  at 
most  nt23m.+4<.  +3*.-t  by  lemma  3.3.  The  probability  that  the  extended  trees  arising  from 
the  embedding  satisfy  the  chosen  parameters  does  not  exceed  the  product  of  the  number  of 
different  families  and  the  bound  on  the  probability  that  any  given  family  arises.  | 


Lemma  3.6.  Ifb>8.  then  the  probability  that  an  embedding  ofT  results  in  a  family  of  t 
extended  trees  in  which  each  T,  contains  m,  star  nodes  and  has  highest  node  v,  is  at  most 

(nliy-1  (23i~7(b  +  1)~2)‘(23-A(6  4-  l))m22< 

Proof.  Let  £<(i>  indicate  the  sum  over  all  possible  integral  values  of  and 

similarly  for  £<,,>•  An  upper  bound  on  the  desired  probability  comes  from  summing  the 
bounds  in  lemma  3.5. 

L  £  ( (^V"1  (226-7(6  +  +  l))m  IR2^6  +  l))/*(23-fc(6  +  l))-] 

~  (n  IT  (22k-7(&+  l)'2)‘(23_i(f>+  l))m  e  e  n<*-‘  (6+ l))'*(23-k(6 +1))* 

'  ^ '  '  <i,>  <».>  i 

=  (nil)'’1  (2a*-7(6+ i)-a),(23-k(*+  urn 

=  (— )‘’l(226-  7  (  6  +l)-2)‘(23-4(6  +  l)rn  ^(24-k(6  +  l))'*j  ^(23-k(fe+  1))*' 

<  (^)“l  (2”-7(6  4-  l)“2)‘(23"k(6  4-  l)r  II  (t)  (1) 

<  ^!L±iy',(22>-7(6+  l)-2l‘(23-*(6  4-  l)r22t 
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Lemma  3.7.  Ifb>$.  (hen  the  probability  that  an  embedding  of  T  results  in  m  star  nodes 
contained  in  a  family  of  t  extended  trees  is  at  most 


m  -  1 
t  -  1 


n  4-  1 


i-i 


(226_r(  6  -t-  1  )-2)'(2j"'i  b  1 1 )m22! 


Proof.  The  first  factor  represents  the  number  of  ways  to  choose  the  highest  node  in  each 
tree.  Order  does  not  matter  when  we  choose  the  highest  nodes,  but  we  must  then  assign 
an  arbitrary  order  to  distinguish  between  the  different  extended  trees.  The  second  factor 
represents  the  number  of  ways  to  apportion  the  m  star  nodes  among  the  extended  trees  so 
that  each  tree  receives  at  least  one  star  node.  The  remaining  terms  come  from  lemma  3.6.  | 


Lemma  3.8.  Fix  b  >  8  and  let  c  be  a  sufficiently  large  constant.  The  probability  that  a 
dynamic  embedding  of  T  in  Hn  results  in  cn  star  nodes  is  at  most 


Proof.  Examine  the  bound  found  in  lemma  3.7.  The  first  two  terms  are  easily  bounded  by 
and  respectively.  Combining  like  terms  reduces  the  bound  to 


/  ce222b~sn(  n  4- 
\  (6  4-  l)2*2 


(23-*(6  +  l))en 


Using  elementary  calculus,  we  find  that  for  sufficiently  large  c.  the  first  term  is  optimized 

when  t2  =  g2  -  Thus,  over  all  values  of  t ,  the  largest  the  first  term  can  be  is  (e^-  )c*  n. 

We  then  add  this  bound  over  the  cn  possible  values  of  t  to  finish  the  proof.  | 

Examine  this  last  bound  on  the  probability  that  there  are  cn  star  nodes.  As  we  increase 
c.  the  third  term  eventually  dominates  the  other  two  terms.  This  allows  us  to  force  the 
probability  to  be  as  small  a  power  of  N  as  we  desire.  This  proves  the  theorem. 

We  know  that,  with  high  probability,  no  more  than  cn  tree  nodes  are  assigned  to  any 
star  in  the  hypercube.  What  remains  is  to  distribute  them  throughout  the  star  so  that  no 
node  has  more  than  a  constant  number  of  tree  nodes  assigned  to  it.  In  the  final  version 
of  this  paper  we  will  show  how  to  remove  some  nodes  from  each  star  so  that  the  partial 
stars  remaining  cover  the  nodes  of  the  hypercube.  Further,  each  node  is  covered  by  exactly 
one  partial  star  and  each  partial  star  haw  at  least  |  nodes.  The  embedding  is  redistributed 
(dynamically)  as  follows.  The  first  2c  tree  nodes  assigned  to  a  hypercube  node  x  are  stored 
there.  Any  further  nodes  which  the  algorithm  tries  to  store  at  x  are  sent  to  some  other  node 
in  x’s  partial  star  containing  fewer  than  2c  tree  nodes.  Such  a  node  always  exists  since  there 
are  no  more  than  cn  tree  nodes  in  the  partial  star  at  any  time.  This  adds  at  most  two  to 
the  dilation  of  the  embedding. 

Now  suppose  that  a  tree  node  which  was  redistributed  in  the  star  wants  to  grow  a  child  in 
the  tree.  Rather  than  growing  a  path  in  the  hypercube  from  the  node  in  which  it  is  actually 
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stored,  the  path  is  grown  from  the  hypercube  node  which  was  originally  supposed  to  contain 
the  tree  node.  This  may  add  another  two  to  the  dilation,  but  it  is  important  for  the  analysis 
that  the  path  continues  to  grow  randomly  from  the  end  of  the  last  random  path. 

The  load- balancing  adjustments  at  both  ends  of  the  path  may  add  up  to  4  to  the  dilation, 
resulting  in  a  path  length  which  is  at  most  12.  Since  each  tree  edge  is  mapped  to  a  path  of 
length  no  greater  than  12,  the  dilation  of  the  embedding  is  constant. 

Finally,  consider  a  tree  T  which  grows  and  shrinks.  Any  form  taken  by  T  has  been 
embedded  according  to  the  algorithm  given  above.  Since  there  are  only  a  polynomial  number 
of  such  trees,  the  theorem  holds  in  this  case  as  well. 


4  A  Lower  Bound  for  Deterministic  Algorithms 


The  following  theorem  shows  that  determinstic  algorithms  perform  poorly  on  this  problem. 
We  prove  that,  even  if  we  restrict  our  attention  to  trees  which  only  grow,  any  deterministic 
embedding  algorithm  which  maintains  load  c  must  have  not  only  maximum  but  average 

dilation  fl(  -■ ).  A  consequence  of  this  is  that  any  embedding  algorithm  which  maintains 
constant  load  must  necessarily  have  dilation  fi(  v'log  ;V). 


Theorem:  Any  deterministic  algorithm  for  dynamically  embedding  trees  in  the  hvpercube 
which  achieves  load  c  must  have  average  edge  length 

Proof:  Let  c  be  the  load  maintained  by  the  embedding  algorithm.  Define  the  sire  of  a  node 
in  the  hvpercube  to  be  the  number  of  l's  in  the  node's  string.  Partition  :he  hypercube  into 
6c  levels,  each  level  corresponding  to  some  range  of  node  sizes  and  containing  £  nodes.  Since 

there  are  at  most  0(  -rv  »i.)  nodes  of  anv  size,  each  level  must  contain  at  least  Q(  ^lo.g-S  )  sizes. 

This  means  that  any  two  nodes  which  are  in  non-adjacent  levels  are  at  distance  fl(  ) 

from  each  other. 


Grow  a  path  of  y  nodes,  starting  at  the  root.  Then  some  level  must  contain  yy  tree 
nodes;  choose  such  a  level.  We  will  continue  growing  the  tree  from  the  yy  nodes  in  the 
chosen  level.  Grow  paths  from  each  of  these  tree  nodes  simultaneously,  stopping  each  path's 
growth  when  it  reaches  a  hypercube  node  which  is  neither  in  the  chosen  level  nor  in  a  level 
adjacent  to  it.  The  total  number  of  nodes  in  the  chosen  level  and  adjacent  levels  is  at  most 
y.  It  follows  that  the  total  length  of  the  y^  paths  is  at  most  y  —  the  number  of  tree  nodes 
which  can  be  stored  in  just  these  levels.  This  verifies  that  the  tree  being  considered  has  at 
most  y  nodes. 


Now  we  cam  calculate  the  average  edge  length.  Since  each  of  the  yy  paths  connects  a 
node  in  the  chosen  level  to  a  node  in  some  non-adjacent  level,  the  total  edge  length  in  these 

paths  is  at  least  yyf^-^^y-^)  =  Q(  ,VV^£-^ ),  Since  the  entire  tree  contains  at  most  A'  nodes. 

it  follows  that  the  average  edge  length  of  the  embedding  is  0(  V ) •  | 
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